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Abstract 

We demonstrate an equivalence between the rank 2 fragments of the 
polymorphic lambda calculus (System F) and the intersection type dis- 
cipline: exactly the same terms are typable in each system. An imme- 
diate consequence is that typability in the rank 2 intersection system 
is DEXPTIME-complete. We introduce a rank 2 system combining 
intersections and polymorphism, and prove that it types exactly the 
same terms as the other rank 2 systems. The combined system sug- 
gests a new rule for typing recursive definitions. The result is a rank 2 
type system with decidable type inference that can type many exam- 
ples of polymorphic recursion. Finally, we discuss some applications of 
the type system in data representation optimizations such as unboxing 
and overloading. 

Keywords: Rank 2 types, intersection types, polymorphic recursion, 
boxing/unboxing, overloading. 



1 Introduction 

In the past decade, Milner's type inference algorithm for ML has become 
phenomenally successful. As the basis of popular programming languages 
like Standard ML and Haskell, Milner's algorithm is the preferred method 
of type inference among language implementors. And in the theoretical 
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community, the literature on type inference is dominated by extensions of 
ML's let-polymorphism. 

In this paper we examine some alternatives to ML that have attracted 
surprisingly little attention: the systems of rank 2 types introduced by 
Leivant [21]. These systems are slightly more powerful than ML — strictly 
more terms can be assigned types — and the increased power comes for free — 
the complexity of typability is identical. But the unique feature of the rank 2 
systems that justifies further study is that, in sharp contrast to other exten- 
sions of ML, they abandon let-polymorphism. 

We use the expression (Xx.xx) to illustrate the limitations of let-poly- 
morphism. It is well known that this expression cannot be typed in ML: 
the only way for ML to type the self- application xx is by assigning a poly- 
morphic type to x, and ML does not allow abstraction over variables with 
polymorphic type. In ML, the only mechanism for introducing variables of 
polymorphic type is the let-expression: 

let x = (Xy.y) 
in xx 

This let-expression binds x to the identity function (Xy.y), which has the 
polymorphic type \ft.t — ► tin ML. By ML's let-polymorphism, x is assigned 
the type \ft.t — ► t, which is sufficient to type xx. 

The problem with this is that we cannot typecheck the uses of x (the 
application xx) separately from its definition (the function (Xy.y)). So ML 
must be extended with a module language in order to support programming 
in the large, where it is impractical to require every polymorphic definition 
to appear in the same source file as every use. 

In contrast, (Xx.xx) is typable in all of the rank 2 systems we consider. 
Here are two rank 2 typings: 

(Xx.xx) : (\ft.t —►£)—► (\fs.s — ► s), 

(Xx.xx) : (t -+ t) A ((t -+ t) -+ (t -+ t)) -►(/-► t). 

The first typing says that (Xx.xx) is a function that, when given an argument 
with type t — ► t for any type t, produces a result with type s — ► s, for any s. 
The identity function is an appropriate argument. 

The second typing says that (Xx.xx) is a function that, when given an 
argument having both the types (t —^ t) and (i — ► i) — ► (i — ► i), produces 
a result of type (t —^ t). Once again, the identity (Xy.y) is an appropriate 
argument. 



The rank 2 systems we consider are subsystems of two widely studied 
type systems, System F and the system of intersection types. System F, 
introduced independently by Girard [7] and by Reynolds [28], predates ML 
and can type many more terms. A recent result of Wells [34], however, shows 
that typability in the system is undecidable, putting type inference out of 
reach. 

The system of intersection types, introduced independently by Coppo 
and Dezani [5] and by Salle [29], can type even more terms than System F: 
it types all (and only) the strongly normalizing terms. 1 The equivalence of 
typability and strong normalization implies that type inference, just as with 
System F, is unattainable. 

With the goal of type inference in mind, we seek decidable restrictions of 
these type systems. Restrictions based on the rank of types were suggested 
by Leivant [21]. The rank of a type can be easily determined by examining it 
in tree form. A type is of rank k if no path from the root of the type to a type 
constructor of interest (either type intersection 'A' or type quantification 'V') 
passes to the left of k arrows. The types shown in Figure 1 are rank 2 types, 
because no path from root to A or V passes to the left of two arrows. But 
the types shown in Figure 2 go beyond rank 2 (they are rank 3 types). The 
types given above for (Xx.xx) are rank 2 types. 
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((t A(t^s))^t)As s -* (Vt.t -^t) -* s 

Figure 1: Examples of rank 2 types 
Ranks and 1 of Leivant 's systems are equivalent to the simply typed 
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((t A s) -+ t) -+ s -► (\/t.t) -► s) -► s 

Figure 2: Types that go beyond rank 2 

lambda calculus, which can type fewer terms than ML. But starting with 
rank 2, the systems can type more terms than ML. 

Rank 2 of System F, which we call A2, has received the most study. Mc- 
Cracken [23] proposed a type inference algorithm for A2 based on Leivant's 
ideas. This algorithm is incorrect. Kfoury and Tiuryn [12] show that the 
complexity of typability in A2 is identical to that of ML. Kfoury and 
Wells [16, 17] give a correct type inference algorithm, and show that ranks 3 
and higher in System F are undecidable. 

Leivant's original paper is almost the only work on rank 2 of the inter- 
section type discipline, which we call I2. Leivant sketched a type inference 
algorithm for I2, but the algorithm was not formalized and proved correct 
until recently [33]. Leivant also conjectured the undecidability of ranks 3 
and higher in the intersection system; to our knowledge the details of his 
proof idea have never been verified. 

1 2 has a significant advantage over A2: it has principal typings. This 
means that for any term M, if M is typable in I2, then there is an 1 2 typing 
judgment 

AY- M : a 

that represents all of the possible typing judgments for M. Other typings 
for M can be obtained from the principal typing by simple operations (sub- 
stitution and subsumption). 



Contributions of the paper 

Since 1 2 has principal typings, and A2 does not, we believe 1 2 deserves 
more study. The first contribution of this paper is to develop some of the 
basic properties of I2. We establish the following equivalence: a term is 
typable in 1 2 if and only if it is typable in A2. 2 An immediate corollary 
is that typability in I2 is DEXPTIME-complete, identical to typability in 
A2 and ML. We also consider some variants of I2, and show they are all 
equivalent in terms of typability. 

The second contribution of this paper is to introduce a new type system, 
P2, that combines rank 2 intersection types and top-level quantification of 
type variables, as in ML. P2 has principal typings, so it clearly improves on 
A2. Its advantage over 1 2 is more subtle. The addition of quantifiers makes 
types more expressive: the quantifiers identify generic type variables, that is, 
type variables which can safely be instantiated with any type. This permits 
a simpler definition of the type inference algorithm, and suggests a novel 
type inference algorithm for recursive definitions. 

A recursive definition is written in the form (jixM), and is meant to 
denote a program x such that 

x = M, 

where M may contain some uses of a;. The standard rule for typing recursive 

definitions looks like 

A U {x : a} h M : a 

A \- (fj,xM) : a 

Most type inference algorithms restrict the type a in this rule to be a simple 
type. The rule of polymorphic recursion relaxes this restriction by allowing 
a to be an ML type scheme. This gives a useful increase in typing power — 
it can type some natural programs that cannot be typed by the simple 
recursion rule. However, polymorphic recursion makes type inference unde- 
cidable [14]. 

We suggest another way of typing recursive definitions: 

AU {x : t] h M : a , , , 

(wfiere a < t) 



A h (jixM) : a 



The rule says that as long as the type a of M is more general than the 
assumption r on x needed to type M, we can deduce a as the type of the 
recursive definition. 



The equivalence between the rank 2 fragments of System F and the intersection type 
discipline has been shown independently by Yokouchi [35]. 



We extend P2 to type recursive definitions in this way. The resulting 
system can type many (but not all) of the examples that seem to require 
polymorphic recursion. Moreover, the system has principal types and de- 
cidable type inference. 

Organization of the paper 

In §2, we introduce I| , a syntax-directed version of I2, and A| , a syntax- 
directed version of A2. The main result is that a term is typable in one 
system if and only if it is typable in the other. An immediate corollary is 
that typability in I| is DEXPTIME-complete, the same complexity as in 
ML and A|. In §3, we present the type inference algorithm for I|. In §4, we 
discuss some other definitions of rank 2 intersection type systems, and show 
their equivalence with I2. In §5, we define P2, show that it has principal 
typings, and give a type inference algorithm. In §6, we discuss various ways 
of typing recursive definitions, and we propose an extension of P2 that can 
type many examples of polymorphic recursion. We discuss applications of 
P2 to compilation in §7, and we summarize our results in §8. 

2 Rank 2 type systems 

2.1 Preliminaries 

We will be defining a number of type systems; here we develop machinery 
that will be useful in all of them. 

We use x,y, . . . to range over a countable set of variables, and t, s to 
range over a countable set, Tv, of type variables. The terms and types of 
the systems will vary, but in all cases we use <7, r, . . . to range over types, 
and M, A, P, ... to range over terms. 

The terms of the (pure) lambda calculus are defined by the following 
grammar: 

M ::= x\ (M 1 M 2 ) | (XxM). 

Unless stated otherwise, terms are considered syntactically equal modulo 
renaming of bound variables. We adopt the usual conventions that allow us 
to omit parentheses: application associates to the left, and the scope of an 
abstraction 'A' extends to the right as far as possible. We write Xxi ■ ■ -x n .M 

for (Aa;i(---(Aa;„M) •••))• 

The types of our systems will all be subsets of the types with quantifi- 
cation and intersection: 

a ::= t | (a-y — ► a 2 ) | (Vicr) | (a-y A a 2 ). 



By convention, '— ►' associates to the right, so that, e.g., (/—►(£—► t)) may 
be written more compactly as t — ► t —^ t, and 'A' binds more tightly than 
'— ►', e.g., a A t — ► t means (a At) — ► t. The scope of a quantifier 'V extends 
as far to the right as possible. We write (Vic) for the type 

(Vii(Vi 2 (...(Vi„a)...))), 

where t = t\,t2, ■ ■ ■ ,t n and n > 0. 

The set of simple types, To, is defined by the following inductive equa- 
tion: 

To = { t | t is a type variable } U { (a — ► r) | <7, r £ To }. 

A type environment is a finite set {x\ : <7i , . . . , x n : a n } of (variable, type) 
pairs, where the variables x\, . . . , x n are distinct. We use A to range over 
type environments. We write A(x) for the type paired with x in A, dom(A) 
for the set {x \ 3t.(x : r) £ A}, and A^ for the type environment A with 
any pair for the variable x removed. We write A\ U Ai for the union of two 
type environments; by convention we assume that dom(Ai) and dom(A2) 
are disjoint. For any set T of types, we say A is a T type environment if 
A(x) e T for all x £ dom(A). 

The notion of free type variable is defined as usual. We write FTV(ct) for 
the free type variables of a type a, and FTV(A) for the free type variables 
of all types appearing in A. We write Gen(A, r) for the V-closure of r by 
the type variables FTV(r) - FTV(A). 

A judgment is a relation between type environments, terms, and types, 
written A h M : a. A term M is typable if A h M : <7 for some A and <7. 
A pair {A, a) of a type environment and a type is called simply a pair. 
Two pairs (Ai,ci) and (^2,02) are disjoint if their free type variables are 
disjoint. An acceptable pair of a term M in a type system is a pair (A, ct) 
such that the judgment A \- M : a holds in the type system. We write 
AP-p(M) for the set of acceptable pairs of M in a type system X>. 

A substitution is a mapping from type variables to simple types which is 
the identity on all but a finite number of type variables. We use S,R,Q,U 
to range over substitutions. The domain and range of a substitution S are 
defined 

dom(S) = {t\St^ t}, 
rng(S) = (J FTV(St). 

tedom(S) 



If dom(5') = {t\,t2, ■ ■ -,t n } and Sti = T{ for all i, then S can be written in 
the form {h := Ti,. . .,t n := r n }. 

The application of substitutions is extended to types, type environments, 
and pairs in the usual way. The composition of substitutions is denoted 
by juxtaposition, so that SRt = (SR)t = S(R(t)). We say S\ and 5*2 are 
disjoint if dom(5'i) and dom(5 , 2) are disjoint sets. If S\ and 5*2 are disjoint, 
then the substitution S\ U 5*2 is defined as follows: 



(Si u s 2 )(t) 



S^t) inedom(Si) 
S 2 (t) iftedom(S 2 ) 
t otherwise. 



Note that we have made a severe restriction on substitutions: they map 
type variables only to simple types, and not types in general. 

2.2 The rank 2 intersection type system 

There are many different formulations of intersection type systems; see van 
Bakel [33] for a survey. We will present a very restricted intersection type 
system here, the system of rank 2 intersection types. Our system is a slight 
generalization of van Bakel's version (see §4.1). 

The terms of the intersection type system are just the terms of the 
lambda calculus. The sets Ti and T2 are defined to be the smallest sets 
satisfying the following equations: 

Ti = T U{(ciAr)|c7,rGT 1 }, 

T 2 = T U {(a -+ t) I a G T x , r G T 2 }. 

The set Ti of rank 1 types consists of finite, nonempty intersections of simple 
types. T2 is the set of rank 2 intersection types: these are types possibly 
containing intersections, but only to the left of a single arrow. Note that 
T = Ti n T 2 , and for i G {0, 1, 2}, if r G T 8 , then St G T 8 . 

In order to simplify subsequent definitions, we adopt the following syn- 
tactic convention: we consider 'A' to be an associative, commutative, and 
idempotent operator, so that any Ti type may be considered a finite, non- 
empty set of simple types, written in the form (A«e/ a i)i where each <7 8 - G To- 

Definition 1 For i G {1,2}, we define the relation < 8 - as the least partial 
order on T 8 - closed under the following rules: 

i) If {tj I j G J} C {<n |ie/}, then (A,- e/ at) <i (A jeJ tj). 



(var) A x U {x : (/\ ieI Ti)} h a; : r 8o (where i G i") 

A X U {x : a}\- M :t 



[ABS) A\-(XxM):<t^t 

A\-M:(A i€l Tj)^a, (Vi € J) A h iV : r,- 
1 J Ah(MiV):(7 

Figure 3: Typing rules of I|. Types in type environments are in Ti, and 
derived types are in T2. 



ii) If a-y <i T-y and r 2 < 2 cr 2 , then (ri — ► r 2 ) < 2 (cti — ► cr 2 ). 

The first rule says that <i expresses the natural ordering on intersection 
types, and the second rule says that < 2 obeys the usual antimonotonic 
ordering on function types, restricted to rank 2. 

Some useful properties of the orderings <i and < 2 are summarized in 
the following lemma. 

Lemma 2 

i) Ifae T and t G Ti, then a <y t iff a = t. 

ii) If a G T 2 and r G To, then a < 2 r iff a = t. 

iii) (J\i e j(Ji) <i (J\j e jTj) iff for all j G J there exists an i £ I such that 

Tj = (Ti. 

i y ) (J\jeJ T i) ~^ T —2 (J\iel (J i) ~^ (T iff t < 2 a, and for all j £ J there 
exists an i £ I such that Tj = 0{. 

v) For i G {1,2}, if a < 8 - r, then Sa < 8 - St. 

Judgments in our rank 2 system are defined inductively by the rules of 
Figure 3. We write I| > A h M : a if the judgment A \- M : a follows by 
these rules, with types appearing in type environments restricted to Ti, and 
derived types restricted to T 2 . The superscript 's' in I| indicates that the 
system is syntax-directed, in contrast with a later variant (see §4). 

If Ay and A 2 are Ti type environments, we define Ay + A 2 , a Ti type 
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environment, as follows: for each x £ dom(Ai) U dom(A2), 
(A 1 + A 2 )(x) = 



Ai(x) if x £" dom(A 2 ), 

A 2 (x) if x £" dom(Ai), 

Ai(a;) A ^2(2) otherwise. 



Lemma 3 (Weakening) // 1| > A h M : o, then I s 2 > A + A' \- M : o for 
any Ti £?/pe environment A' . 

Proof: An easy induction on typing derivations. □ 

Lemma 4 (Substitutivity) Ifl\>A\-M:a, then l%> SA*r M : So for 
any substitution S . 

Proof: By induction on the structure of M. 

i) If M = x, then A(x) = (J\ ieI Oi) and o = Oi for some %q £ /. Then 
SA(x) = (f\i e j Sai), I| > SA h x : Sai , and So = Soi . 

ii) If M = XxN then o must be of the form T\ —^ t 2 , and I| > A x U {x : 
Ti} h iV : T2. Then by induction, I| > S(A X U {x : Ti}) h iV : 6Y2, so by 
rule (abs), F 2 > SA X \- N : Sn ^ St 2 , or I| > SA X h JV : S(n -► r 2 ). 
Then by weakening, I| > 5 A h N : 5(ti — ► T2). 

iii) If M = MiM 2 , then for some (f\ ieI Ti) £ T x we have I| > A h M x : 
(AieJ r «) ~^ ^ an( i ^-2 > A h M2 : t 8 - for all i £ I. By induction we 
have I| > 5A h Mj : (/\,- e/ St,) -► So and I| > SA h M 2 : SV,-, and by 
rule (app), we have I| > SA h M\M 2 : So, as desired. 

□ 



2.3 System F 

The terms of System F are exactly the terms of the lambda calculus. The 
types of System F are defined by the following grammar: 

t ::= t\ (n -+ t 2 ) I (Vir). 

We consider System F types to be syntactically equal modulo renaming of 
bound type variables, reordering of adjacent quantifiers, and elimination of 
unnecessary quantifiers. 
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The types of System F can be organized into a hierarchy as follows. 
First, define R(0) = To- Then for n > 0, the set R(n + 1) is defined to be 
the least set satisfying 

R(n+1) = R(ra)U{(o- -► r) | a G R(n),r G R(ra + 1)} 
U{(Vto) I ct G R(ra + 1)}. 

It will be useful to restrict types so that quantifiers do not appear to the 
immediate right of arrows. Therefore we define the sets 

S = S' U {(Vta) | a G S}, 
S' = T U{((i^r) | ae S,r G S')}. 

We write S(ra) for S n R(ra) and S'(ra) for S' l~l R(ra). Note that the S(l) 
types are exactly the ML type schemes. 

Definition 5 Suppose a = Vii • • -t n .T G S(l), and t,t' G To- We say t' is 
an instance of a, written a y r', if and only if for some pi, . . . , p n G To, we 
have t' = {t\ := pi, . . . ,t n := p n } T - We write a y (Vsi • • -s m r') if and only 
if si, . . . , s m are not free in a and ct >- r'. 

Note that the sense of V is opposite to that of our other subtyping relations: 
both "it <2 t" and "<7 >- r" may be read, "ct is more general than r." We 
make an exception in the case of V to be consistent with its use in ML [24]. 

We now define A|, our version of the rank 2 fragment of System F. The 
superscript 's' in A| indicates that the system is syntax-directed. See Kfoury 
and Tiuryn [12] for a definition of A2, the non-syntax-directed version. 

The judgments of the system are defined by the rules of Figure 4. We 
write A|>AhM:rifAhM:ris derivable from these rules, where types 
in type environments are restricted to S(l), and derived types are restricted 
to S'(2). 

A| is closely related to the system A^ studied by Kfoury et al. [12, 17]: 

Theorem 6 

i) If A| > Ah M :a, then A^ > A h M : a. 

ii) If A^ > A h M : a, then a is of the form Vii • • -t n a', where a' G S'(2), 
and A| > A h M : a'. 

This equivalence follows immediately from results of Kfoury and Wells [17]. 
It implies the following useful result: 

Lemma 7 If A| > A h M : a and Gen(A, a) y a', then A| > A h M : a'. 

11 



(var) A x U {x : a} h x : r (where a y r) 

A x U {x : ri} h M : r 2 



ABS 



A h (AzM) : ri -► r 2 



|App) .HM:,^)-,,^^ (e a dl ,,FTV<,„ 



Figure 4: Typing rules of A|. Types in type environments are in S(l), and 
derived types are in S'(2). 



2.4 ML 

Many different formulations of the ML type system have been studied; we 
choose to present a syntax-directed version here, as in Clement et al. [4] or 
Tofte [32]. 

The types of ML are the types To, and the ML type schemes are the 
types S(l). The terms of ML are the terms of the lambda calculus extended 
with let-expressions: 

M ::= x\ {M 1 M 2 ) \ (XxM) | (let x = M x in M 2 ). 

The judgments of ML are defined inductively by the rules of Figure 5. We 
write ML i>AhM:rifihM:ris derivable from these rules, where types 
in type environments are restricted to S(l), and derived types are restricted 
to T . 

Definition 8 An ML type r is a principal type for M in A if and only 
if ML > A h M : r, and for all ML types r', if ML > A h M : r', then 
Gen(A,r) y t' . 

Theorem 9 (Principal types for ML) If M is typable by A, then there 
exists a principal type for M in A. 

Lemma 10 If ML > A h M : t, and Gen(A, r) y t' , then ML > A h M : r'. 

2.5 Relationship of A2 and I2 

We now show that a term is typable in A| if and only if it is typable in I|. 
The left to right implication is developed first. 
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(var) A x U {x : a} h x : r (where a y r) 

4 h M : ti ^ r 2 , A h N : r x 



APP 



ABS 



LET 



A h (MN) : r 2 

4 U {i : rj h M : r 2 
A h (AaiM) : ri -► r 2 

A h Mi : ri, A^ U {a; : Gen(A, r x )} h M 2 : r 2 
Ah (let a; = Mi in M 2 ) : r 2 



Figure 5: Typing rules of ML. Types in type environments are in S(l), and 
derived types are in To- 

Definition 11 

i) We define a relation ^i between S(l) and Ti as follows. Suppose 
t G S(l) and &!,..., cr n £ To (n > f). Then r ^i (AieJ (7 «) if an( i 
only if r >- <7 8 - for all i G /. 

ii) We define the relation <2 between S'(2) and T 2 inductively: 

a) For any type variable t, t <2 t. 

b) If t ^i t' and ct < 2 °' , then (r -► a) < 2 (t' -+ a'). 

Note that the relation ^ 2 is monotonic in the argument of function types, in 
contrast to the relation < 2 . We extend the relation ^i to type environments 
as follows: A ^i A' if and only if a: 6 dom(A) and A(x) <\ A'(ai) whenever 
x £ dom(A'). 

Theorem 12 If A s 2 > A h M : t, then I| > A' h M : t' , where A ^ A' and 

T < 2 T 1 . 

Proof: By induction on derivations. 

i) M = x and A| > A h x : r follows by the A| rule (var). Then we 
must have A(x) y t. 

Let A' = {x : r}. Clearly I| > A' h M : r, A ^ x A', and r ^ 2 r. 

ii) M = XxN, t = a — ► ri, and A| > A h AaiA : cr — ► r x follows by the A| 
rule (abs). 
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Then we must have 

A s 2 > Au{x :a} h iV : r x . 
By induction, we have 

I|> A'u{x :a'} h iV : t[, 

where A ^i A', a ^i a', and ri <2 T {- So by the I| rule (abs), we 
have 

F 2 > A' h N : a' -> r l7 

where A ^i A', and (<7 — ► ri) ^2 (o 7 - ► t{), as desired. 

iii) M = M X M 2 and A| > A h MiM 2 : r follows by the A| rule (app). 
Then we must have, for some To G To, 

A| > A h Mi : (V£r ) -► r, 

A| > A h M 2 : r , 

where the type variables t do not appear in FTV(A). Then by induc- 
tion we have 

r 2 >A hM i: (/\r 8 )^r', 
iei 

where A ^1 A , r ^ 2 r', and (Vf.r ) ^1 (Aie/ r i)- 

Then each r 8 - is an instance of (Vi.ro), and therefore by Lemma 7, 

A| > A h M 2 : r t - for all i G i". 

By induction we have for all i G I, I 2 > A' h M 2 : r 8 -, where A ^1 A'-. 
So if A' = A + Sj- e /A'-, then A ^1 A', and by weakening, 

F 2 >A'hM i: (A 8e/ r 8 )^r', 

I| > A' h M 2 : r t - (Vi G I). 

Then by the I| rule (app) we have 

F 2 > A' h M 1 M 2 : t', 

as desired. 



□ 



We now show the other direction of the equivalence: any term typable 
in I| is typable in A|. 
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Convention 13 In the remainder of this section we do not consider terms 
to be identical modulo a- conversion, and we will assume the following con- 
vention regarding the names of bound and free variables: 

i) No variable is bound more than once. 

ii) The bound and free variables are disjoint. 

This convention is necessary to make the following function well-defined: 

Definition 14 Let e denote the empty sequence. The function, act, that 
maps terms to sequences of variables, is defined inductively by the following 
rules. 3 

i) act (a;) = e. 

ii) If act(M) = x\, . . . , x n then act(XyM) = y, x\, . . . , x n . 

iii) If act(M) = y, x\, . . . , x n (n > 0) then act(MN) = x\, . . . , x n . 

iv) If act(M) = e then act(MJV) = e. 

Definition 15 

i) 7 is the rule 

(Xx(XyM))N -> Xy((XxM)N). 



i 



is the compatible closure of 7. 



iii) A 7-redex is any term matching the left-hand side of the rule 7. We 
say M is a 7-normal form, or 7-nf, if no subterm of M is a 7-redex. 

Note that by our convention on the distinct naming of variables, there is no 
capture of variables in the 7 rule. We use the name "7" in accordance with 
Kfoury and Wells [18]. See Barendregt [2] for a definition of "compatible." 

Lemma 16 

i) -^ 1 is strongly normalizing. 

ii) -^ 1 satisfies the diamond property. 
iii) 7-nf's are unique. 



Our definition is identical to the definition of [12], but differs from [11]. 
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Proof: 

i) The proof is similar to the proof of Lemma 5.5 from Kfoury and 
Wells [17]: 

Let appl(M) be the set of subterms of M that are applications, and 
let 

6(M) = J2 max(0, |act(Mi)| - 1). 

(MiM 2 )eappl(M) 

If M -> 7 N, then 6(M) = 6(N) + 1. Since for any M we have 6(M) > 
0, we can conclude that — >- 7 is strongly normalizing. In fact, S(M) > 
iff M contains a 7-redex, and M normalizes in exactly S(M) steps. 

If \M\ is the size (number of subterms) of M, then clearly |appl(M)| < 
\M\ and |act(M)| < \M\. Thus 6(M) < \M\ 2 . Therefore normaliza- 
tion of a term M takes 0(|M| 2 ) steps. 

ii) This is a simple case analysis. 

iii) This follows from (ii). 

□ 

Lemma 16 justifies the following definition: 

Definition 17 We write -y-ni(M) for the 7-nf of M. 

Lemma 18 For V £ {I|, A|}, the following hold: 

i) V> AY- (Xx(XyM))N : a iff V> AY- Xy((XxM)N) : a. 
n)IfM-y 1 N,thenV>A\-M:a iff V>AY~N:a. 
iii) V> AY- M :a iff V> AY- 7 -nf(M) : a. 
Proof: 

i) Simple case analysis. 

ii) Use (i) and induction on the definition of compatible, 
iii) Use (ii) and induction on the length of rewriting. 
□ 
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Lemma 19 If act (M) = x\, . . . , x n and I| > A h M : a, then a is of the 

form <7i — ► • • • — ► a n —^ t, where t £ To- 

Proof: By induction on the structure of M. 

i) If M = x, then n = by the definition of act, and a £ To by rule 

(var). 

ii) If M = XxiN , then I| > A h M : a follows by rule (abs), and therefore 
a is of the form a\ —^ a', where a\ £ Ti. 

Also we must have act(iV) = X2, . . .,x n (n > 1) and I| > A U {x\ : 
a\} \- N : a' . By induction a' must be of the form 02 —^ ■ ■ ■ —^ o n —^ t, 
where 02, . . . , o n £ Ti and r £ To- 

iii) If M = M 1 M 2 , then I| > A h M : a follows by rule (app), and 
therefore we have I| > A h M\ : a' —^ a, where a' £ Ti. 

We consider two cases. If act (Mi) = y,x\, . . . , x n for some variable 
y, then by induction, a is of the form a\ —^ ■ ■ ■ -^ a n —^ r, where 
ai,. . .,a n £ Ti and r £ T . 

Otherwise act (Mi) = e, and therefore act(M) = e, so we only need 
prove it £ To- And by induction, we have (a' — ► ct) £ To, so a £ To- 

D 

Note 20 A similar lemma holds for A|, c.f. Kfoury et al. [12], Lemma 15. 

Lemma 21 Suppose M is a j-nf. Then 

act(M) ^ e iff M = XyN for some y, N. 

Proof: By induction on the structure of M. The cases M = x and M = 
XyN are trivial, so assume M = M\M.2- We must show act(M) = e. 

By way of contradiction, assume that act(M) = x\, . . .,x n (n > 1). By 
the definition of act, we must have act (Mi) = y, x\, . . . , x n for some y. 
Then act (Mi) 7^ e, so by induction we have Mi = XyM{, and act(M{) = 
x\, . . . , x n . Since ra > 1, act(M{) 7^ e, and by induction M[ = \x\M". But 
then M is a 7-redex, contradiction. □ 

Definition 22 We define a mapping, ml, from terms to ML terms: 
i) ml(a;) = x. 
ii) m\{XxM) = (Xx ml(M)). 
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iirt mUMM^-l (let ^ = ml(M 2 ) in ml(iV)) HM 1 = \xN, 
m) ml(M 1 M 2 ) - | (ml(Mi) ml(M2)) otherwise. 

Definition 23 

i) A generalization of a set T of simple types is a type a G S(l) such that 
a y t for every r G T. A generalization a of T is the least common 
generalization of T if a' y a for any other generalization a' of T. 

ii) If (Aie/ r «) £ Ti, we define lcg(/\ 8e jr 8 ) to be the least common gen- 
eralization of {r 8 - | i G I}. If (Ti, . . . , (T n G Ti and r G To, then 

lcgOi -► ► CT n -► r) = lcg(o-i) -► ► lcg(CT n ) -► r. 

The function leg is extended to type environments in the usual way. 

The use of "least" in the name "least common generalization" is consistent 
with the relation l y\ Recall that the sense of V is opposite to that of our 
other subtyping relations, so that "least" for V means "greatest" for the 
other relations. 

The concept of least common generalizations was developed by Plotkin 
[26] and Reynolds [27]. They showed that any finite nonempty set of simple 
types has a least common generalization, and they gave an algorithm to 
compute it. 

Lemma 24 If M is a j-nf and a G To, then 

i) I| > A h M : a implies ML > lcg(A) h ml(M) : a; and 
ii) A| > A h M : a if and only if ML > A h ml(M) : a. 

Proof: 

i) By induction on the structure of M. 

a) The case M = x is trivial. 

b) If M = XyN, then I| > A h M : a follows by the I| rule (abs), so 
a must be of the form r — ► a' where r, a' G To, and I| > A U {y : 
r} h N : a' . Note that JV is a 7-nf, so we can apply the induction 
hypothesis to get 

ML > lcg(A U {y : r}) h ml(iV) : a'. 

Now r G T , so lcg(A U {y : r}) = lcg(A) U {y : r}. Therefore 
ML > lcg(A) U {y : r} h ml(iV) : a', so by the ML rule (abs), 
ML > lcg(A) \~ ml(XyN) : r -^ a', as desired. 
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c) If M = (AyMi)M 2 , then our judgment must follow by I| rules 
(abs) and (app). Thus we have 

Il>AU{y.(/\ i€l (n)}\-M 1 :(T, 

(Vi el) I s 2 > A h M 2 : a % . 

Let \/tr = \cg(/\ ieI <Ji), where r £ To, and no t{ appears in A. 
By induction, we have 

ML > lcg(A) U {y : Vfr} h ml(Mi) : a, 
(Vi G /) ML > lcg(A) h ml(M 2 ) : ct,-. 

By the principal type property of ML, we have 

ML > lcg(A) h ml(M 2 ) : r. 
Then since ml(M) = (let y = ml(M2) in ml(Mi)), we have 

ML > lcg(A) h ml(M) : a 

by the ML rule (let). 

d) If M = M1M2, where Mi is not an abstraction, then by the I| 
rule (app), we have for some a', 

I| > A h Mi : ct' -► d, 
I| > A h M 2 : ct'. 

Mi is a 7-nf and is not an abstraction, so by Lemma 21, we have 
act (Mi) = e. Then by Lemma 19, o' —* o e To, and therefore 
a' e To- M 2 is also a 7-nf, so we may apply the induction 
hypothesis to both judgments above, to get 

ML > lcg(A) V ml(Mi) : a' -> a, 
ML > lcg(A) h ml(M 2 ) : a'. 

Then by the ML rule (app), we have 

ML > lcg(A) h ml(MiM 2 ) : a, 

as desired. 

ii) Similar, but easier. 

□ 
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Note 25 The converse of Lemma 24(i) does not hold. For instance, if a = 
ts and A = {x : ti A ^2}? then lcg(A) = {x : \/t.t}, m\(xx) = xx, and 
ML > {x : \/t.t} h xx : £3, but the judgment {a; : ii A ^2} l~ ^^ : ^3 cannot be 
derived in I|. 

Theorem 26 IfI s 2 >A\-M:cr, then A| > lcg(A) h M : lcg(cr). 

Proof: Suppose act(M) = x\, . . .,x n . Then by Lemma 19, a is of the form 
oi —►•••—► an —► r, where r £ To, and by Lemma 21, the 7-nf of M is of 
the form \x\ ■ ■ -Xx n N, where N is a 7-nf. By Lemma 18(iii), 

I2 > A h Xxi ■ ■ -Xx n N : a. 

This judgment must follow by n uses of the I| rule (abs), so we have 

I2 > A U {^i : (Ti, . . . , x n : (T n } \- N : t. 

Then by Lemma 24, we have 

A s 2 > lcg(A U{x 1 :<r 1 ,...,x n : a n }) h iV : r. 

By n uses of the A| rule (abs), we have 

A2 > lcg(A) h Aa>i • • • As n A : lcg(a), 

and by Lemma 18(iii), we have 

A s 2 > lcg(A) h M : lcg(cr). 

D 

Theorem 21 If M is a term of the pure lambda calculus, then M is typable 
in I| if and only if M is typable in A|. 

Therefore, typability in I| is DEXPTIME-complete. 

Proof: The equivalence of I| and A| typability follows from Theorems 12 
and 26. 

Kfoury and Tiuryn [12] show that A| typability is polynomial time 
equivalent to ML typability. ML typability was shown to be DEXPTIME- 
complete independently by Kfoury et al. [15] and by Mairson [22]. □ 

This equivalence has been shown independently by Yokouchi [35]. 
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3 Type inference for I| 

We present the type inference algorithm for I| and a proof that it infers prin- 
cipal pairs. The algorithm is not new: it was described briefly in Leivant's 
original paper [21], and was defined rigorously by van Bakel in his disser- 
tation [33]. We include it here because the algorithm provides a way to 
compare a variety of type systems based on rank 2 intersection types. 

The algorithm takes as input a term M , and produces a pair (A, a) such 
that I| > A \- M : a. Moreover, the pair {A, a) is principal in the sense 
that any other acceptable pair of M can be obtained from (A, a) by some 
well-understood operations. 

Definition 28 

i) We write A <i A 1 if x G dom(A) and A(x) <i A'{x) for all x G 
dom(A'). 

ii) The ordering < on (Ti type environment, T2 type) pairs is defined as 
follows: 

{A, o-) < (-4', o-') if and only if A' < x A and a < 2 a'. 

iii) A pair {A, a) is a principal pair for M if {A, a) G APp(M), and for 
any other pair (A' , a') G APp (M), there is a substitution S such that 

S(A,a)<(A',a'). 

Note that <i and < are transitive, and A + A' <\ A for all Ti type envi- 
ronments A, A'. 

3.1 Subtype satisfaction 

In this section we give a decision procedure for one of our subtyping relations, 
and show how to solve a more general problem, subtype satisfaction, that 
we use in our type inference algorithm. 

Up until now, we have relied on some syntactic conventions to simplify 
our presentation, namely, that 'A' is an associative, commutative, and idem- 
potent operator. Part of the problem we are addressing here is how to decide 
whether two types are equivalent under these assumptions. Therefore, in this 
section, we do not rely on the syntactic conventions in any way. 

Subtype satisfaction is a generalization of the well-known problem of 
unification, and the techniques we use here are based on those used to solve 
unification. For more details, consult a survey on unification [19, 20, 30, 10, 
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6, 31, 1]. One difference between unification and our satisfaction probiems is 
that we work with types that go beyond simpie types, but our substitutions 
invoive oniy simpie types. This is not the typicai case with unification, and 
it makes our probiem easier to soive. 

If Si, 5*2 are substitutions and V is a set of type variables, we say Si 
and 5*2 are equivalent on V, written Si =y S2, if S\t = 62 i for every t G V . 
We say Si is more general than S2 on V , written Si <y S2, if there is a 
substitution S3 such that S2 =v S3S1. The relation <y is a partial order 
modulo =y. We omit V when V = TV. A substitution S is idempotent if 
S = SS, or, equivalently, if dom(5') fl rng(5) = 0. 

We define the relation <2,i between T2 and Ti as the least relation 
closed under the rule: 

• If cr <2 Ti for all i £ I, then a <2,i (AieJ 7 *)- 

A <2,i- satisfaction problem is a pair 3s. P, where P is a finite set whose every 
element is either: 1) an equality between simple types; or 2) an inequality 
between a T2 type and a Ti type. When s is empty 3s may be omitted. 
We use 7r to range over <2,i-satisfaction problems. 

A substitution S is a solution to 3s. P if there is a substitution S' such 
that S(t) = S'(t) for all t G - s, S'a <2,i S't for all inequalities (a < r) G P, 
and 5" cr = 5"r for all equalities (a = r) £ P. The (possibly empty) set of 
solutions to a problem 7r is written Solutions(7r). Two problems tti and ^2 
are equivalent if Solutions (7Ti) = Solutions(7T2). 

Definition 29 

i) A substitution U is a mosi general solution to it if it satisfies the 
following conditions. 

a) U G Solutions(7r). 

b) If 5 G Solutions(7r) then U <ftv(tt) S. 

c) [7 is idempotent. 

d) dom(U) C FTV(vr). 

ii) We write MGS(7r) for the (possibly empty) set of most general solu- 
tions to a <2,i-satisfaction problem it. 

We require the last two conditions on most general solutions for technical 
convenience only. We could relax the definition by eliminating those condi- 
tions; but any it has a solution under the relaxed definition if and only if it 
has a solution under our definition. 
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Sometimes it is useful to ensure that a most general solution does not 
interfere with a set of "protected" variables. For any set W of type variables, 
we say U is a most general solution to it away from W if U G MGS(7r) and 
W fl mg(U) = 0, and we write MGS(7r)[T / F] for the (possibly empty) set of 
most general solutions to it away from W. 

Lemma 30 IfU G MGS(7r)[T / F] and S G Solutions(7r), thenU <vfuFTV(tt) 
S. 

Proof: Since U <ftv(tt) S, there is some R such that RU =ftv(tt) S. 
Define 

Rl(t = | R(t) iftemg(U), 
1 S(t) otherwise. 

If t G FTV(vr), then R'(U(tj) = R(U(tj) = S(t). And if t G W - FTV(vr), 
then t g (dom(7r) U rng(Tr)), so R'{U{t)) = R\t) = S(t). □ 

A unification problem is a subtype satisfaction problem involving only 
equalities. Algorithms for solving unification problems are well known; in 
particular, we have the following theorem. 

Theorem 31 Let it be a unification problem and W be a finite set of type 
variables. 

i) Solutions(Tr) = iff MGS(vr) = iff MGS(ir)[W] = 0. 

ii) There is an algorithm that decides whether it has a solution, and, if 
so, returns an element of MGS(7r)[T / F]. 

Proof: See for example Snyder [31], Lemma 3.3.11. □ 

Theorem 32 Every ^^-satisfaction problem is equivalent to a unification 
problem, and moreover, there is an algorithm that transforms every <2,i- 
satisfaction problem into an equivalent unification problem. 

Corollary 33 Let it be a ^^-satisfaction problem and W be a finite set of 
type variables. 

i) Solutions(Tr) = iff MGS(vr) = iff MGS(ir)[W] = 0. 

ii) There is an algorithm that decides whether it has a solution, and, if 
so, returns an element of MGS(7r)[T / F]. 
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(cti — ► a 2 ) < t =>- 3t 1 ,t 2 .{ti < ai,(T2 < h,t = ti — ► ^2} 

if t\ , t 2 are fresh 

Oi -r o 2 ) < (n -> t 2 ) => {n < cti, a 2 < t 2 } 

a < (ri A r 2 ) =^ {o" < ri, cr < r 2 } 

i < t =>• {i = r} 

if r is a simple type 

Figure 6: Transformational rules for < 2j i-satisfaction problems 



We will prove Theorem 32 by giving an algorithm that transforms any 
< 2j i-satisfaction problem into an equivalent unification problem. Corol- 
lary 33 follows by combining the transformation with any unification algo- 
rithm. 

Our transformation is defined by rules of the form 

a <t =>• 3t.P. 

The rules may need to introduce fresh type variables, that is, type variables 
that do not appear on the left-hand side. These variables will appear in 
the variables t of the right-hand side (but they are not the only source of 
variables in t). 

The rules are used to define a rewrite relation on problems: 

a < t =>• 3t.P 



3s.P' 1+1 {a < t} => 3s\St.P'UP 

The operator '!+)' is disjoint union; on the right of the consequent, it means 
that the variables t must be fresh (this can always be achieved by renaming). 
The rules for transforming a < 2j i-satisfaction problem into a unification 
problem are given in Figure 6. 

Proof of Theorem 32: We show that the rules of Figure 6 constitute an 
algorithm for converting any < 2j i-satisfaction problem into an equivalent 
unification problem. 

First, note that every rule transforms a < 2j i-satisfaction problem into 
another < 2j i-satisfaction problem (equalities are between simple types, in- 
equalities are between T 2 and Ti types). 
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Second, note that each rule preserves the set of solutions, so that each 
application of a rule transforms a problem into an equivalent problem. 

Third, note that repeated application of these rules must halt: every 
rule reduces the number of type constructors ('— ►' or 'A') in inequalities or 
reduces the number of inequalities. 

Finally, note that a normal form contains no inequalities, and is therefore 
a unification problem. □ 

Theorem 34 The subtyping relation <2,i is decidable. 

Proof: To see whether a <2,i r, compute U G MGS({<7 < r}) and check 
to see whether U is the identity substitution. □ 

Decision procedures for the other subtyping relations can be obtained in 
a similar way. 

Because we so often want to ensure that U G MGS(7r) is chosen "away" 
from a set of type variables, we adopt the following convention. 

Convention 35 Whenever U G MGS(7r) occurs in any mathematical con- 
text, we assume that U is chosen so that it does not interfere with "current" 
type variables, that is, U G MGS(tt)[VT] where W U FTV(vr) is the set of 
type variables present in the context. 

3.2 Type inference 

Definition 36 For any term M, we define the set PPp (M) of pairs by 
induction: 

i) If M = x, then for any type variable t, ({x : t},t) G PPp (x). 

ii) If M = XxN, and {A, a) G PPib (JV), then: 

a) If x G - dom(A), and t is a type variable not appearing in (A, a), 
then (A,t^ a) G PPi^XxN). 

b) If x G dom(A), then (A x , A(x) -? a) G PPi| (XxN). 

iii) If M = MiM 2 , then: 

a) If (Ai,i) G PPi|(Mi) and (A 2 ,a 2 ) G PPi|(M 2 ) are disjoint, and 
U G MGS({i = t\ —^ t2,(J2 < t\}) where ti,t 2 are fresh, then 

^(A 1 +A 2 ,i 2 )GPPn(M 1 M 2 ). 
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b) If (A^ifa^ai) -+ CTl ) G PPi^Mx), and (A t ,r t ) G PPi*(M 2 ) for 
all i G I, where all pairs are chosen disjoint, and U G MGS({t; < 
Oi | i G /}), then 

t/(A x + E.-e/A,-,^) G PPjs (M X M 2 ). 

The following lemma establishes that the elements of PPp(M) are just 
trivial variants of each other. Therefore, the requirement of disjointness 
used in the definition of PPp is easily satisfied, and Definition 36 can be 
adapted to a type inference algorithm. 

Lemma 37 

i) If {A, a) G PPp(M), then x G dom(A) if and only if x is free in M. 

ii) Suppose (A 1 ,a 1 ) G PPi|(M). Tien (A 2 ,a 2 ) G PPi|(M) if and only if 
there is a bijection R of type variables such that R{Ai, a\) = (A 2 , 02)- 

Proof: An easy induction on Definition 36. □ 

Theorem 38 There is an algorithm that decides, for any M , whether the 
set PPp (M) is empty; and furthermore, if PPp (M) is not empty, it pro- 
duces a member o/PPp(M). 

Proof: Just follow the rules of Definition 36, generating "fresh" type vari- 
ables as necessary, and use the algorithm of Corollary 33 to compute MGS. 

□ 



Example 39 We show how the algorithm finds the type of (Xx.xx). 
i) PPp (2;) produces a pair ({x : ii},ii). 
ii) PPp (2;) (again) produces a pair ({x : ^j,^)- 
iii) To calculate PPp (xx), we find a most general solution to 

such as {^2 := ^3,^1 := ^3 — ► £4}- Then ({x : t% A (£3 — ► £4)}, £4) G 
iv) Finally, PPp (Xx. xx) produces (0,^3,^4.^3 A (t^ — ► t^) —^ t^). 

We now establish the soundness of PPp . 
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Theorem 40 If {A, a) G PPi»(M), then (A, a) G APis(M). 
Proof: By induction on the definition of PPp (M). 
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i) If M = x, then (A, a) = ({x : t},t), and we have (A, a) G APi|(a;) by 
rule (var). 

ii) If M = XxN , then by Lemma 37(i) we have the following two cases: 



a) x is not free in A, and a = t —^ a' ', where (A, a 1 ) G PPp (A). 
By induction and weakening, (Au{x : t},cr') G APp (A) (note 
that A U {a; : t} is well-formed by Lemma 37(i)). 

So by rule (abs), {A,t -+ a') = {A, a) G APi|(Aa:JV). 

b) x is free in A and (A, a) = (A' x ,A'(x) -> a'}, where (A', a'} G 
PPl*(A). 

By induction {A 1 , a') G APis(A), so {A, a) G APp^AsA) by 
rule (abs). 

iii) If M = M1M2, then one of the following cases holds: 

a) (A, a) = U(A 1 +A 2 ,t 2 ), where (A u t) G PPi|(Mi), (A 2 ,cr 2 ) G 
PPls(M 2 ), and U G MGS({* = ^ -+ i 2 ,cr 2 < ij). 
Phen by induction, weakening, and substitutivity, 



U(A 1 + A 2 ,t) G APis(M x ), 

C/(A 2 + A 2 ,ct 2 ) G APjs(M 2 ). 



Since Ua 2 < 2 Ut\, by Lemma 2(ii) we have Ua 2 = Ut\. And 
Ut = (Uh) -> (Ut 2 ), so by rule (app) we have U(A 1 + A 2 ,t 2 ) G 
APjs(M). 

b) (A, a) = tf(Ai + E,- e M,-,CTi), where (A t ,T t ) G PPi|(M 2 ) for all 
1 G /, (A u (h eI Oi) -+ ai) G PPi^Mx), and U G MGS({r t - < 

<Ti\ie /}). 

Phen by induction, weakening, and substitutivity, 



^(A 1 + E,- e/ A,-,(A,- e /^)^^i) e APis(M x ), 

^(Ai + E.-e/A,-,^) G APn(M 2 ) (Vi G /). 



□ 



By Lemma 2(ii) and the fact that JJt{ < 2 Udi, we have JJt{ = 
Uui. Phen by rule (app) we have U (A x + ^ ieI A l ,a 1 ) G AP P (M). 
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Theorem 41 (Principal pairs for I|) If (A, a) £ APp (M), then there is 
a pair (A', a') £ PPp (M) and a substitution S such that S(A',a') < (A, a). 

Proof: By cases on the structure of M. 

i) If M = a;, then {A, a) £ APi| (M) by rule (var), and therefore, A(x) = 
{J\i£l&i) and a = <7 8 - £ To for some %q £ /. 

For any t, ({x :t},t) £ PPp(M). Then {t := a} is a well-formed 
substitution and 

{t:=a}({x:t},t) = ({x:a},a)<(A,a). 

ii) If M = XxN, then by the definition of I|, o must be of the form 
°i —* < 7 2 5 an d (A x U {x : cti},^) £ APp(iV). By induction, there is a 
substitution S and pair (A',<7 2 ) £ PPp (A) such that 
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S{A\a' 2 )<{A x \j{x:a 1 },a 2 ). (1) 

We consider two cases. 

a) If x £" dom(A'), then for any fresh type variable t, (A', t — ► a 2 ) £ 
PPis^A). 

Note that a\ is of the form (A 8 'e/ ,T «)' anc ^ therefore, we can pick 
a[ £ To such that a\ <\ a[ (choose any a{). Then let S' = {t : = 
a[} U S. By (1) and the definition of <, 

S'(A',t^ a' 2 ) = {SA\a[ -+ Sa' 2 ) < {A x ,a 1 -+ a 2 ). 

Since A <i A x , we have S'(A',t — ► <7 2 ) < (A, <7i — ► <7 2 ), as de- 
sired. 

b) If x £ dom(A'), then (A^, A'(x) -> o' 2 ) £ PPp^AzA). Then 
by (1) and the definition of <, 

S(A' x ,A'(x) -+ o' 2 ) <{A x ,o x -> a 2 ), 

and since A <i A x , we have S(A' x ,A'(x) — ► <7 2 ) < (A, <7i — ► <7 2 ), 
as desired. 

iii) If M = MiM 2 , then by the definition of I|, {A, (f\ ieI (Ti) -+ a) £ 
APi|(Mi) and (A,cr 8 ) £ APis(M 2 ) for all i £ J. 

By induction, PPp (Mi) is nonempty, and by Lemma 37(ii), it is suf- 
ficient to consider the following cases on the structure of pairs in 
PPi^Mx). 
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a) {A\,t) G PPp (Mi). By induction, there is a substitution S\ such 
that 

S^A^t) < (A,(/\<Ti)^a). 
iei 

By the definition of < 2 , S\t = 0{ — ► a' for some i £ I and <r' G To- 
Then by induction and Lemma 37(ii), there is a disjoint pair 
(A 2 ,t) G PPp(M2) and substitution 5*2 such that 

S 2 (A 2 ,t) < (A,<Ti). 

Let 7r = {t = t\ —^ t 2 ,T < t\}, where ^1,^2 are fresh. Then 

S = Si U 5*2 U {ti := Cj-,i2 := a '} is a solution to 7r. 

Pick £/ G MGS(vr). Then U(A X + A 2 ,t 2 ) G PPi* {M 1 M 2 ). 

By Convention 35, there exists an R such that RU(Ai + A 2 ,t 2 ) = 

S(A 1 + A 2 ,t 2 ). And 

^(Ai + A 2 ,t 2 ) = (S 1 A 1 + S 2 A 2 ,a') < (A, a), 

as desired. 

b) (^(A^O-^ePPiitMi). 

By induction there is a substitution S\ such that 

je-J iei 

By the definition of < 2 , {S\a': \ j G J} C {<7 8 - | i G /}, so without 
loss of generality we assume J C I and S\a'- = Uj for all j G J. 
By induction and Lemma 37(ii), for all j £ J there are disjoint 
pairs (Aj,pj) G PPp(M2) and substitutions Sj such that 

Sj(Aj,pj} < (A,aj). 

Let 7r = {pj < CTj I j G J}. Then S = S\ U (UjeJ ^j) is a solution 
to 7r: Spj = ,5jPj <2 (Tj = S\cr'- = Sa'j. 
Pick U G MGS(vr). Then 

U(A t + V^jAj,*') G PPi | (M 1 M 2 ). 

By Convention 35, there exists an R such that 

RU(A! + E jeJ A,V> = S^ + X^jA^a'). 
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And 

S(A! + XjejAj,*') = (Si Ai + X^jSjA^Skt') < (A, a), 
as desired. 



□ 



4 Other systems of rank 2 intersection types 

4.1 A restriction of 1% 



L 2 



Van Bakel [33] denned a rank 2 intersection type system that is a slight 
restriction of our system I|. A version of his rules is presented below. 

(var) {x : t} \- x : t (where r G T ) 

A x U {x : Ti} h M : r 2 



ABS 



APP 



A^ h (AaiM) : ri -► r 2 

A h M : (A,- £ j r,-) -+ r, (Vi € J) A,- h iV : r,- 
A + S 8G/ A 8 h (MJV) : t 



We write 1^ > A h M : <7 if the judgment A \- M : a follows by these 
rules, under the following restrictions: environment types are in Ti; derived 
types are in T 2 ; and in every judgment Ah M : r, the type environment A 
contains only assumptions actually used in the derivation of A h M : r. 
For example, the rule (var) has been intentionally restricted to rule out a 
judgment such as 

{x : <7i A (72 } h a; : <7i, 

in which the type 02 assumed for a; is not used. Similarly, {x : a\,y : (T2} h 
a; : (Ti is not derivable because the assumption y : 02 is not used. The exact 
relation between DJ and I| is summed up in the following lemma. 

Lemma 42 (Comparison of T^ h and I|) 

i) If if" > A h M : a, then I| > A h M : o\ The converse does not hold. 

ii) A term M is typahle in I^ h if and only if it is typahle in I|. 

Proof: 

i) Just note that the T^ h rule (var) is a special case of the I| rule (var), 
that the I2 rule (abs) is identical to the I| rule (abs), and that the 
1^ rule (app) follows from the I| rule (abs) and weakening. 
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The examples above show that the converse does not hold. 

ii) This follows because the definition of principal pair in van Bakel's 
system is identical to our own. 



□ 



4.2 An extension of I2 

A natural extension of I| is obtained by adding the following rule to the 
rules of I| : 

( SUB ) AVM:a ( wherer ^2<r) 

We write I2 > A h M : a if the judgment A\- M : a follows by the rules of 
I| plus (sub), with types appearing in type environments restricted to Ti, 
and derived types restricted to T2. 

Clearly, every judgment of I| is a judgment of I2. The converse does not 
hold; for example, the judgment 

{x : a ^ t} \- x : (a A a') —^ t 

is derivable in 1 2 for any a ^ a 1 £ To, but is not derivable in I|. 

I2 has principal pairs, and indeed, they are identical to the principal 
pairs of I| (the proof is a simple extension of the proof of Theorem 41). An 
immediate consequence is that the terms typable in I2 are exactly the same 
as the terms typable in I|. 

In summary: 

Lemma 43 (Comparison of I2 and I|) 

i) If I| > A \- M : a, then I2 > A h M : a. The converse does not hold. 

ii) A term M is typable in 1 2 if and only if it is typable in I|. 

Although it does not type any more terms than I|, 1 2 has other advantages 
over I|. 

Example 44 The acceptable pairs of I| are not closed under the opera- 
tion <: 

I2 > {x : s — ► t} h x : s -^ t, 

and 

({x : s -+ t}, s -^t) < ({x : s -+ t}, (sAi)->t), 
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but the judgment 

{x : s — ► t} h a; : (s A t) —^ t 

is not derivable in I|. 

On the other hand, 1 2 is closed under <: 

Lemma 45 (Weakening for I 2 ) If I 2 > A \- M : a and {A, a) < {A 1 , a'}, 
then I 2 > A' h M : ct'. 

For this reason, we prefer 1 2 to either 1^ or I|. However, it was still useful 
to develop I|. In particular, the example above shows that Lemma 19 does 
not hold for I2; it was convenient to have Lemma 19 for the proof of the 
equivalence of typability with A|. 

5 Combining intersections and quantification 

5.1 The system P 2 

We now describe a type system that combines aspects of rank 2 intersection 
types and rank 2 polymorphic types. The system is called P2, as it is the 
rank 2 subset of a type system P (described elsewhere). 

The types of the system are the rank 2 intersection types extended with 
top-level quantifiers: 



-V2 



T 2 U {(Vta) I a £ T V2 }- 



In order to simplify the definition of subtyping, we consider T\/2 types 
syntactically equal modulo renaming of bound type variables, reordering of 
adjacent quantifiers, and elimination of unnecessary quantifiers. When a 
T\/2 type is written in the form Vsir, we assume a £ T2. 

Definition 46 

i) The relation <\/2 is the least partial order on T\/2 closed under the 
following rules: 

a) If a <2 t, then a <\/2 t. 

b) If t £ T , then (Via) < V 2 {t := r}a. 

c) If a <\/2 t and t is not free in a, then a <\/2 (V£t). 

ii) The relation <\/2,i between T\/2 and Ti is the least relation closed 
under the rule: 
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a) If a <V2 n for all i G I, then a <V2,1 (Aie/ 1 " 8 ')- 

The rules for <\/2 express the intuition that a type is a subtype of its in- 
stances. They are equivalent to the following rule, similar to ML's notion of 
generic instance: 

• If \s := p}a <2 t, where p is a vector of simple types, and the type 
variables t are not free in (Vsir), then Vs*<r <\/2 Vir. 

Note that we only allow instantiation of simple types. This ensures that 
instantiation does not take us beyond rank 2. It also has less desirable 
implications, e.g., (Vt.t) is not a least type in the ordering <\/2 : (Vt.t) ^\/2 
(si A (si -► s 2 )) -► s 2 - 

The relation <\/2,i is n °t a partial order; it is not even reflexive. This is 
because it relates types "across rank." Note that in a comparison 

(Vir) < V2 ,i (/\<Ti), 
iei 

the variable t may be instantiated differently for each <7 8 -. 

Some basic properties of <\/2 and <\/2,i are summarized in the following 
lemma. 

Lemma 47 

i) Ifcr,T G To, then a <\/2 t iff & <V2,i t iff a = t. 

ii) Ifcr,T G T2, then ct <\/2 r J 'ff ^ <2 r - 

iii) If a <\/2 7", then (Vtu) <\/2 (Vir). 

iv) If it G T2 and r G To, then Vtu <\/2 7" iff for some substitution S with 
dom(S) C t, we have 5(7 = r. 

v) For any substitution S and types ct,t G T\/2, if So <\/2 7", then 

S(Vt<j) <V2 7". 

vi) For any substitution S, types ct,t G T\/2, and type environment A, if 
Sa <V2 7", then S(Gen(A,a)) <V2 7". 

vii) If ^ <V2 cr 2 <V2,1 ^3 <i cr 4 , then cri <v2,i ^4- 

The typing rules of the system are presented in Figure 7. We write 
P2 > A h M : a if the judgment A \- M : a follows by these rules, with 
types appearing in type environments restricted to Ti, and derived types 
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(var) A x U{x :(/\ ieI Ti)}\- x :T io (where i G /) 

A x U {x : a} h M : r 



ABS 



Ah (Aa?M) : d^ r 


A h M : a' ^ r A h iV : a 


Ah (MJV) : r 


AV M :t 



j± r ivi . u — *■ ; j± r i\ . u . . 

APP) t-j — , , (where a <\/ 2 1 ° ) 



(sub) ; (where Gen(A,r) < V2 ct) 



Figure 7: Typing rules of P 2 . Types in type environments are in Ti, and 
derived types are in T\/ 2 . 



restricted to T\/ 2 . Note that the familiar rules (inst) and (gen) are special 
cases of the rule (sub): 

A h M : Vta , , m , 

(INST) AhM:{t:=T}a (where rG To) 

, , A h M : a . , . . ,. 

(GEN) — — — — — — (where t is not free m A) 

y J A\- M :\fta y ' 

The ordering < of Dehnition 28 is extended to pairs with T\/ 2 types as 
follows: 

(-4, o) < (A' ', ct') if and only if A' < x A and ct < V2 <?' ■ 

Lemma 48 (Weakening for P 2 ) 7/P 2 > A h M : ct and (A, a) < {A 1 , a'}, 
then P 2 > A' h M : ct'. 

Lemma 49 (Substitutivity for P 2 ) J/P 2 >ihM: ct, tfien P 2 > SA h 
M : 5(7 /or any substitution S . 

5.2 Extending subtype satisfaction 

In order to perform type inference for P 2 , we will need to solve problems 
that generalize the < 2j i-satisfaction problems of §3.1. 

A <\/2,i-satisfaction problem it is a pair 3s. P, where P is a finite set 
whose every element is either: 1) an equality between simple types; or 2) 
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an inequality between a TV 2 type and a Ti type. A substitution S is a 
solution to 3s. P if there is a substitution S' such that S(t) = S'(t) for all 
t (j£ s, S'a <V2,1 5' / ' r f° r an inequalities (a < r) £ P, and 5 'a = 5'r for all 
equalities (a = r) £ P. 

Note that any <2,i-satisfaction problem is a <\/2,i-satisfaction problem 
with the same set of solutions. Therefore we abuse notation and write 
Solutions(7r), MGS(7r), and MGS(7r)[T / F] for the solutions, most general 
solutions, and most general solutions away from W of a <\/2,i-satisfaction 
problem it. 

Similarly, <\/2,i-satisfaction problems can be solved by extending the 
transformational algorithm of Figure 6 by the following rule: 

(Via) < t => 3t{a < r} 

if t is not a A-type, and t is not free in r 

Theorem 50 Every <V2,1- satisfaction problem is equivalent to a unification 
problem, and moreover, there is an algorithm that transforms every <\/2,i- 
satisfaction problem into an equivalent unification problem. 

Proof: We show that the rules of Figure 6, augmented by the rule above, 
constitute an algorithm for converting any <\/2,i-satisfaction problem into 
an equivalent unification problem (equalities are between simple types, in- 
equalities are between T\/2 and Ti types). 

First, note that every rule transforms a <\/2,i-satisfaction problem into 
another <\/2,i-satisfaction problem. 

Second, note that each rule preserves the set of solutions, so that each 
application of a rule transforms a problem into an equivalent problem. 

Third, note that repeated application of these rules must halt: every rule 
reduces the number of type constructors ('—►', 'A', or 'V') in inequalities or 
reduces the number of inequalities. 

Finally, note that a normal form contains no inequalities, and is therefore 
a unification problem. □ 

Corollary 51 Let it be a <V2,1- satisfaction problem and W be a finite set 
of type variables. 

i) Solutions(Tr) = iff MGS(vr) = iff MGS(ir)[W] = 0. 

ii) There is an algorithm that decides whether it has a solution, and, if 
so, returns an element of ~M.GS(ir)[W]. 

Theorem 52 The subtyping relation <\/2,i is decidable. 
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Proof: To see whether a <\/2,i r , compute U G MGS({<7 < r}) and check 
to see whether U is the identity substitution. □ 

5.3 Type inference for P 2 

Definition 53 For any term M, we define the set PPp 2 (M) by induction 
on M. 

i) If M = x, then for any type variable t, ({x : t},t) G PPp 2 (a;). 

ii) If M = XxN, and (A,\/sa) G PPp 2 (iV), where the type variables s are 
distinct from all other type variables, then: 

a) If x G - dom(A), and t is a type variable not appearing in (A, Vsir), 
then (A,Vt?(i -► ct)) G PPp 2 (Aa;iV). 

b) If a; G dom(A), then (A x , Ge^A^, A(x) -? a)) G PPp 2 (Aa;iV). 

iii) If M = M 1 M 2 , the pairs (Ai,Vs<Ti) G PPp 2 (Mi) and (A 2 ,a 2 ) G 
PPp 2 (M2) are disjoint, and the type variables s are distinct from all 
other type variables, then: 

a) If <7i is a type variable t, t\ and t 2 are fresh type variables, U G 
MGS({ff 2 <h,t = t 1 ^ t 2 }), and A = [/"(Ai + A 2 ), then 

(A,Gen(A,^i 2 )) G PPp 2 (M). 

b) If <n = n -+ r 2 , U G MGS({d 2 < n}), and A = U{Ai. + A 2 ), 
then 

(A,Gen(A,C/r 2 ))GPPp 2 (M). 

Just as with I|, the elements of PPp 2 (M) are trivial variants of each 
other, so Definition 53 can easily be adapted to a type inference algorithm. 

Lemma 54 

i) If {A, a) G PPp 2 (M), then x G dom(A) if and only if x is free in M. 

ii) Suppose (Ai, cti) £PPp 2 (M). Tien (A 2 , o 2 ) G PPp 2 (M) if and only if 
there is a bijection R of type variables such that R(A\, G\) = {A 2 , a 2 ). 

Proof: An easy induction on Definition 53. □ 

Theorem 55 There is an algorithm that decides, for any M , whether the 
set PPp 2 (M) is empty; and furthermore, «/PPp 2 (M) is not empty, it pro- 
duces a member o/PPp 2 (M). 
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Proof: Just follow the rules of Definition 53, generating "fresh" type vari- 
ables as necessary, and use the algorithm of Corollary 51 to compute MGS. 

□ 

We now establish the soundness of PPp 2 . 

Theorem 56 If {A, a) G PPp 2 (M), then (A, a) G APp 2 (M). 

Proof: By induction on the definition of PPp 2 (M). 

i) If M = x, then {A, a) = ({x : t},t) for some type variable t. 
Then we have {A, a) G APp 2 (a;) by rule (var). 

ii) If M = XxN , then by Lemma 54(i) we have the following two cases: 

a) x is not free in N, (A,\/scr') G PPp 2 (iV) for some a', and a = 
\/ts(t — ► a') for some fresh type variable t. 

By induction, (A,\/sa') G APp 2 (iV), and by weakening, 

(AU{x :t},a') G APp 2 (iV) 

(note that A U {x : t} is well-formed by Lemma 54(i)). 

So by rule (abs), (A,t —> a'} G APp 2 (Aa;iV), and by then by 

rule (sub), 

{A,Vts(t -> a')} = {A,a} G APp 2 (Aa;iV). 

b) x is free in N and {A, a) = (A' x ,Gen(A' x ,A'(x) -^ cr 1 )), where 
(A',VsV) GPPp 2 (iV). 

By induction and rule (sub), {A 1 , a'} G APp 2 (iV), so by rule (abs), 
(A' x ,A'{x) -+ a') G APp 2 (Aa;iV). Then by rule (sub), 

(A' x ,Gen(A' x ,A'(x) -> (J 1 )) = (A, a) G APp 2 (Aa;iV). 

iii) If M = M 1 M 2 , then we have disjoint pairs (Ai,Vs<Ti) G PPp 2 (Mi) 
and (A 2 ,o- 2 ) G PPp 2 (M 2 ). By induction, (Ai,Vs<Ti) G APp 2 (Mi) and 
(A 2 ,<t 2 ) G APp 2 (M 2 ). By rule (sub), we have (A 1 ,<r 1 ) G APp 2 (Mi). 

We now consider two cases. 

a) If <7i is a type variable t, then we must have A = U(A\ + A 2 ) and 
a = Gen(A, Ut 2 ), where U G MGS({ct 2 < t u t = t x -+ t 2 }) for 
fresh type variables t\ and t 2 . 
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By substitutivity we have 

U{A u g 1 ) = (UA U (Uh) -+ (Ut 2 )) £ APp 2 (M!) 

and 

U(A 2 ,a 2 ) = (UA 2 ,Ua 2 )eAF F2 (M 2 ). 

By weakening, 

(UA 1 + UA 2 ,(Uh) -+ (Ut 2 )) £ APp 2 (M!) 

and 

(UAi + UA 2 , Ua 2 ) £ APp 2 (M 2 ). 

Then since Ua 2 <\/2,i Ut\, by rule (app) we have 

{UA 1 + UA 2 , Ut 2 ) = {A, Ut 2 ) e APp 2 (M!M 2 ). 
Then by rule (sub), 

(A,Gen(A,Ut 2 )) = {A, a) £ APp 2 (MiM 2 ). 
b) The case a\ = T\ —^ t 2 is almost identical to the last. 
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Theorem 57 (Principal pairs for P 2 ) If {A, a) £ APp 2 (M), then there 
is a pair [A 1 , a') £ PPp 2 (M) and a substitution S such that S(A',cr') < 
(A, a). 

Proof: By induction on the definition of APp 2 (M). 

i) If {A, a) £ APp 2 (M) by rule (var), then M = x for some variable x, 
A(x) = (/\i£i&i), and a = <7 8 - £ To for some %q £ /. 

By the definition of PPp 2 , {{x : t},t) £ PPp 2 (M), where t is a fresh 
type variable. 

Then {t := a} is a well-formed substitution and 

{t:=a}({x:t},t) = ({x:a},a)<(A,a). 

ii) If (A, a) £ APp 2 (M) by rule (abs), then M = XxN, a is of the form 
o-i -► a 2 , and (A x U {x : a 1 },a 2 ) £ APp 2 (iV). 

By induction, there is a substitution S' and pair (A',\/sa 2 ) £ PPp 2 (iV) 
such that 

S'{A'^sa' 2 )<{A x ij{x:a 1 },a 2 ). (2) 

We consider two cases. 
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a) If x G - dom(A'), then for any fresh type variable t, 

(A',Vts(t -> a' 2 )) G PPp 2 (Aa;iV). 

It remains to show that there is a substitution S such that 

S(A',Vts(t^ a' 2 )) < (A, a). 

Just let S = S'. By (2), we have A <i A x <\ S'A' , so we only 
need show 

S'(yts(t -^ Cr' 2 )) <V2 0"1 -^ d 2 . 

We can assume t, s are fresh, so that 

S'(Vts(t -^ (J 2 )) = Vts(t -+ S'a' 2 ). 

And by (2), 

{t := ai}\/s(t — ► S'a 2 ) = Vs(cri — ► S'a 2 ) <y 2 a x — ► <J 2 , 

so by the definition of <\/2 5 S'(\/ts(t — ► o^)) <\/2 01 - ► (7 2 as 
desired. 

b) If x G dom(A'), then (A^, Gen(A^, A'(x) -> a' 2 )} G PPp 2 (Aa:iV). 
Then by (2) and the definition of <, 

S'(A' x ,Gen(A' x ,A'(x) -+ a' 2 )} < {A^a, -+ a 2 ), 

and since A <i A x , we have S'(A' x ,Gen(A' x ,A'(x)^a 2 )) < 
{A,(7i — ► a 2 ), as desired. 

iii) If (A, a) G APp 2 (M) by rule (app), then M = M X M 2 , (A, ri -► a) G 
APp 2 (Mi), (A,r) G APp 2 (M 2 ), and r < V2 ,i r x . 

By induction, we have substitutions S\ and S 2 , and disjoint pairs 
(Ai,Vs» G PPp 2 (Mi) and (A 2 ,t') G PPp 2 (M 2 ), such that 

SMuVsp) < (A,n^a), (3) 

5*2(A 2 ,r') < (A,t). (4) 

We may assume without loss of generality that dom(5i), dom(5 , 2), 
and s* are disjoint. 

We now consider two subcases. 
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a) p is a type variable t. 

Let £i,/2 be fresh type variables, and let it = {r' < ii,i = t\ —^ 

By (3), we have 

S\(Vs.t) < V2 n -> a, 

and therefore, (S\ U .S^)/ <\/2 r i - ► ^ f° r some ^ with domain s. 
By the definition of <\/2 5 we must have 



(SiUS{)* = 


r[ - ct', 


ri <i 


r{, 


o' <V2 


o-, 


some rJ,(r'eTo. And by (4), 




S 2 t' <V2 T < 


V2.1 r l- 



Therefore, 5 = (Si U ^J U 5*2 U {ii := t{,^2 : = o 7 }) is a solution 
to IT. 

Pick £/ G MGS(vr), and let A' = U{A 1 + A 2 ). Then 
(A',Gen(A',Ut 2 )) G PPp 2 (M). 

By Convention 35, there exists an i? such that RA' = RU(Ai + 

A 2 ) = S(A 1 + A 2 ) and RUt 2 = St 2 . 

Since RA' = S^ + A 2 ) = ftAi + £2^2, we have A < x iEA'. 

And RUt 2 = St 2 = a' <y 2 <7, so by Lemma 47(vi), we have 

iE(Gen(A', Ut 2 ) < V2 a. 

Therefore, 

R(A',Gen(A',Ut 2 )) < (A, a), 

as desired. 

b) P = Pi— P2- 
By (3), we have 

S 1 (Vs.p 1 -> p 2 ) <V2 n -> a, 

and therefore, 

(S 1 US[)p 2 <V2 o-, 

ri <i (S'lUS'iK 
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for some S[ with domain s. And by (4), 

S 2 t' <V2 t <v2,i n- 

Therefore, S = (S\ U S[U S 2 ) is a solution to the problem it = 

{r 1 < Pi}- 

Pick U G MGS(vr), and let A' = U(A 1 + A 2 ). Then 

{A',Gen(A',Up 2 }) G PPp 2 (M). 

By Convention 35, there exists an R such that RA' = RU(Ai + 
A 2 ) = S(A t + A 2 ) and RUp 2 = Sp 2 . 
By (3) and (4), 

RA' = S(A 1 + A 2 ) = S\A 1 + S 2 A 2 < x A. 

And since RUp 2 = Sp 2 = (Si U S[)p 2 <y 2 u, by Lemma 47(vi) 

we have R(Gen(A' , Up 2 ) <y 2 a. 

Therefore, 

R(A',Gen(A',Up 2 )) < (A, a), 

as desired. 

iv) If (A, a) G APp 2 (M) by rule (sub), then for some a', we have a shorter 
derivation of {A, a') G APp 2 (M), and Gen(A, a') <y 2 a. 

By induction there is a pair [A 1 , a") G PPp 2 (M) and a substitution S 
such that S(A',a") < (A, a'). 

We now show that if t & FTV(A), then t & FTV(Sa"). Since Sa" < V2 
a', this implies Sa" <y 2 Gen(A, ct'), and therefore by transitivity, 
Sa" < V2 <t. 

Assume by way of contradiction that t (j£ FTV(A) and t G FT\(Sa"). 
Since A < x SA', FTV(SA') C FTV(A). Therefore, t <£ FTV(A) => 

t £ FTV(SA'). 

Since t £ FTV(SA') and t G FTV(Sa"), there must be some u G 
FTV(d") - FTV(A') such that t G FTN(Su). However, it is easily 
checked that (A 1 , a") G PPp 2 (M) => FTY(a") - FTV(A') = 0, so we 
have reached a contradiction. 
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The next result shows the strong connection between the systems 1 2 and 
P2: a term is typable in one system if and only if it is typable in the other. 
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Theorem 58 For any M , P 2 > A h M : \/ta for some t if and only if 
I 2 > Ah M : a. 

Proof: Each direction can be proved by induction on derivations. 

• I 2 =/- P 2 : The rules (var), (abs), and (sub) are trivial, so assume 
M = M 1 M 2 and I 2 >ih M 1 M 2 : a follows by rule (app). Then we 
must have 

I 2 >A\-M 1 :{j\ Ti )^o 
iei 
and 

(Vi G /) I 2 > A h M 2 : r,-. 

By induction we have 

iei 

and 

(Vi £/)P 2 >ihM 2 : r,-. 

By the principal typing property of P 2 , there is a pair (A', a') and 
substitution 5 such that 

P 2 > SA' h M 2 : SV, 

A <i SA', and iSV <\/2 T i f° r all i £ I- By weakening, 

P 2 t> A h M 2 : SV. 
Then by the P 2 rule (app), 

I 2 > A h M X M 2 : a. 

• P 2 =^ I 2 : The rules (var) and (abs) are trivial. 

If P 2 > A h M : Via follows by rule (sub), then we must have a shorter 
derivation of 

P 2 >ihM: Vsr, 

and Gen(A,Vsr) = (Vwr) <y 2 (Vic). We must show 

I 2 > A h M : o\ 

By induction, 

I 2 >ihl:r. 
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Furthermore, by the definition of <\/2 5 for some sequence p of sim- 
ple types, we have {u := p}r < 2 o. We may assume that the type 
variables u do not appear in A. Then by substitutivity, 

I 2 > A h M : {u := ^}r, 

and by the I 2 rule (sub), we have I 2 > A h M : <7, as desired. 

Otherwise, M = MiM 2 and P 2 > A h MiM 2 : a follows by rule (app). 
Then 

P 2 > A h Mi : ct' -^ ct 

and 

P 2 >ihM 2 : Vsr, 

where Vsr <\/2,i o' ■ By induction we have 

I 2 > A\- M 2 :t. 
If a' = /\i e i<Ji, then by the definition of <\/2,i an d by substitutivity, 

I 2 > A h M 2 : a,- 
for all i £ I. Then by the I 2 rule (app), 

P 2 > A h M X M 2 : a. 



6 Recursive definitions 

We now consider ways of typing recursive definitions. We extend the gram- 
mar of our language to include terms of the form (pxM). Such a term is 
meant to represent the program x such that x = M, where M may contain 
occurrences of x. 

In ML, recursive definitions are typed by the following rule: 

A x U {x : r) h M : r , , m , 

(REC-SIMPLE) A h : r (where rG To) 
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6.1 Recursive definitions in A 2 

In A2 and ML, the rule (rec-simple) seems overly restrictive. Both systems 
allow ML type schemes to appear in type environments and as derived types, 
suggesting the rule of polymorphic recursion: 

, , A x U \x : t\ \- M : r , , __ 

(rec-poly) A I { J M) : t (where r e S(l)) 

Example 59 When extended by (rec-poly), both ML and A 2 can type 
the following terms: 

(jiw.(\xyz.z)(w 3)(w true)) : \ft.t — ► t, 



(jix.xx) : \ft.t. 

Neither is typable with the rule (rec-simple). Other examples are given 
by Mycroft [25] and Kfoury et al. [13, 15], who introduced (rec-poly) 
independently. 

Unfortunately, type inference for A2 or ML extended by (rec-poly) is un- 
decidable [14, 9], so (rec-simple) is used in practice. 

6.2 Recursive definitions in I 2 

The rule (rec-simple) is one way of typing recursive definitions in intersec- 
tion type systems. However, as with ML and A2, it seems overly restrictive. 
The rule (rec-poly) involves S(l) types, so it is not appropriate for the 
intersection type systems. Instead, we might consider a rule like the follow- 
ing: 

A x U {x : r)} h M : r 



A h (fixM) : t 



(where r £ TV 



Note that the full power of the rule is achieved only by allowing Ti derived 
types, so the rule is not compatible with the rank 2 intersection type systems 
that we have defined so far. However, the rule can be adapted to our systems 
as follows: 

, (Vie I) A x U{x:(A; pT Ti)}\- M :Ti , , . 
(REC-INT) J AhV^Vr- (where , EI) 
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The system I2 + (rec-int) can type the following terms: 
(jiw.(\xyz.z)(w 3)(w true)) : t —^ t, 



( jiw .(Xxy .y)(ww)) 



t, 



where r is any simple type. Neither term is typable in I2 + (rec-simple). 
The close connection between I2 and A2 casts some doubt on the de- 
cidability of the system I 2 + (rec-int). However, I 2 + (rec-int) cannot 
type all of the terms that can be typed by A2 + (rec-poly). For example, 
the term (jix.xx) cannot be typed in I2 + (rec-int). The decidability of 
I2 + (rec-int) is an open question. 

6.3 Recursive definitions in P 2 

The system P2 could be extended to type recursive definitions with either 
the rule (rec-simple) or the rule (rec-int) (the rule (rec-poly) is not 
appropriate since it requires S(l) types to appear in type environments). 
Surprisingly, however, we can do better: we now propose two rules, (rec) 
and (rec-vac), for typing recursive definitions in P2. The rules will allow 
use to type more terms than (rec-simple), and we retain principal typings 
and decidable type inference. We will also give a typing rule for mutually 
recursive definitions, of the form 



(letrec x\ = Mi , • • • , 5 

where the variables X{ are distinct. 
The typing rules are defined below. 

A x U {x : r} h M : a 



REC 



REC-VAC 



LETREC 



A h (jixM) : a 

A X V M :a 
A \- (fj,xM) : a 



M n in M), 



(where a <\/2 1 t) 



(where x is not free in M) 



A, 
(Vi<n) A x 



X\ "-X 



U {x 1 : ri,...,; 
n U{x t : ri,...,: 



: r n } h M : a 

: T n } h (fj,XiMi) : a l 



A h (letrec x x = M x , . . . , x n = M n in M) : a 

(where Vi < ra, 0{ <\/2,i T i) 

We write P^ > A h M : a if the judgment A\- M : a follows by the rules 
of P 2 and the rules (rec), (rec-vac), and (letrec), with types appearing 
in type environments restricted to Ti, and derived types restricted to T\/2- 
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The rule (rec-vac) is necessary to type terms like 

(jiw(\x.xx)) : \/s,t.s A (s — ► t) —^ t. 

In order to use the rule (rec) in this case, we would need a type r G Ti 
such that \/s,t.(s A (s — ► t)) — ► t <\/2,i T - There is no such type, because s 
and s — ► t cannot be unified. 

Note in the hypothesis of the rule (letrec), we are careful to type each 
definition Mi as a recursive but not mutually recursive definition. Thus at 
first, each Mi needs to satisfy only the constraints on X{ implied by the 
occurrences of X{ in M 8 - itself; constraints implied by occurrences in M or 
other Mj are satisfied second. In between, the type of M 8 - can be generalized. 

Example 60 

i) The following terms are typable in P^, but not in P2 + (rec-simple): 

(jiw.(\xyz.z)(w 3)(w true)) : \/t.t — ► t. 

( jiw .(Xxy .y)(ww)) : \/t.t — ► t. 

ii) The term (fix.xx) is not typable in P!f • It has type (Vi.i) in ML+(rec- 
poly) and A 2 +(rec-poly). 

Definition 61 The set PP p r(M) of principal pairs for a term M is defined 
just as PPp 2 , with the addition of the following clauses: 

iv) If M = (fixN) and (A, a) G PP p r(JV), then: 

a) If x i dom(A), then (A, a) G PP p r(M). 

b) If x G dom(A) and U G MGS(u < A(x)), 
then (UA x ,Gen(UA x ,Ua)} G PP p r(M). 

v) If M = (letrec x x = M x , . . . , x n = M n in M ), 
and (Ai,(Ti) G PP p r(/U2; 8 'M 8 ') for 1 < i < n, 

(Ao,cto)gPP p r(Mo), 

A' = A + Si< 8 '< n A 8 -, 

U G MGS({ui < A'(xi) I 1 < i < n, x % G dom(A')}), 

and A" = A' T , 

then {UA",Gen(UA",Ua )} G PP(M). 
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Theorem 62 If {A, a) G PP p r(M), then (A, a) G AP pR (M). 

Proof: By induction on the definition of PP p r(M). For the ruies of P2, 
see the proof of Theorem 56. We only need to consider the following cases. 

iv) If M = (jixN), we consider two cases. 

a) If x is not free in N, then {A, a) G PPpii(iV). By induction, 
{A, a) G APp R (iV),andbyrule (rec-vac), {A, a) G AP F B.(fj,xN). 

b) If x is free in N, then for some (A', a') G PP P R(iV) and U G 
MGS((t' < A'(x)), we have 

(A,a) = (UA' x ,Gen(UA' x ,Ua')). 

By induction, (A', a'} G AP pR (iV). Then (UA',Ua'} G AP pR (iV) 
by substitutivity. Since Ua' <\/2,i UA'(x), by rule (rec) we have 
{UA' x ,Ucr') G AP p k((j,xN). Finally by rule (sub), 

(UA' x ,Gen(UA' x ,Ua')) G AP pR (^iV). 

v) If M = (letrec x x = M x , . . . , x n = M n in M ), 
then (A, a) = (U A" , Gen(U A" , U a )} , where 
{Ai,(Ti) G FF P B.(fJ,XiMi) for 1 < i < n, 

(Ao,cto)gPP p r(Mo), 

A = A + Si< 8 '< n A 8 -, 

U G MGS({ui < A'(x t ) I 1 < i < n, x t G dom(A')}), 

and A" = A' _ . 

By induction, (A 8 ',<7 8 -) G AP p r(//2; 8 'M 8 ') for 1 < i < n, and (Ao,0o) £ 
AP p r(M ). 

By weakening and substitutivity, (UA',U(7i) G APpR( J us,M;) for 1 < 
i < n, and (CM', C/"ct ) G AP p r(M ). 

Then by rule (letrec), {UA",Ua ) G AP p r(M), and by (sub), 
(A,ct) = (£M",Gen(£M",£/c7o)) G AP p r(M). 
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Theorem 63 (Principal pairs for P^) If (A, a) G AP pR (M), then there 
is a pair (A', a') G PP p r(M) and a substitution S such that S(A',cr') < 
(A, a). 
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Proof: By induction on the definition of AP p r(M). For the ruies of P2, 
see the proof of Theorem 57. We only need to consider the following cases. 

v) If {A, a) G AP p r(M) by rule (rec-vac), then M = {jixN), x is not 
free in N, and (A x ,a) G AP pR (iV). 

By induction, we have a pair (A', a') G PPpR,(iV) and a substitution S 

such that S{A',cr') < (A x ,a) < {A, a). 

By Lemma 54(i), x G - dom(A'), so (A', a') G PPpR.(/ia:iV) as desired. 

vi) If {A, a) G AP p r(M) by rule (rec), then M = {jixN), and for some 
t G Ti, we have {A x U {x : t},g) G AP pR (iV) and a < V 2,i r. 

By induction, we have a pair (A', a') G PPpii(iV) and a substitution S 
such that 

S(A',a')<(A x U{x:r},a). (5) 

We consider two cases. 

a) If x G - dom(A'), then {A 1 , a') G FF p k((j,xN), and by (5), A < x 
A x <i 5 A' and 5 a 7 <\/2 c, as desired. 

b) If x G dom(A'), and r' = A'(x), then by (5), Sa' <\/2 ^ <V2,i 
r <i St', so 5 is a solution to 7r = {a' < r'}. 

Then pick U G MGS(vr), so that 

(UA' x ,Gen(UA' x ,Ua')) G PP P R(/xa;JV). 

By Convention 35, there exists an R such that RUA' X = SA' X and 
RUa' = Sa'. By (5), A < x SAJ. and by (5) and Lemma 47(vi), 
R(Gen(UA',Ua')) < V2 a. Therefore 

R(UA' x ,Gen(UA' x ,Ua')} < (A, a), 

as desired. 

vii) If {A, a) G AP p r(M) by rule (letrec), then for some x,N,f of 
length n, M = (letrec x = N in JV ), (Ag U {x:f},a) G AP pR (iVo), 
and (Ag U {if : r}, <7 8 -) G APpR^a^-iV;) and <7 8 - <\/2,i r « f° r all i < n. 

By induction, we have pairs (Ao,0o) ^ PPp R (^o) 5 an d (Ai,a[) G 
PP P R(;ua; 8 'iV 8 '), and substitutions So, S\, . . . , S n such that 

So(A ,(t' ) < (AgU{x:f},a), (6) 

Si(Ai,<r<) < (Ag\j{x:f},a % ) (7) 
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for all i < n. 

Let A' = A + ?l 1 <i< n Ai,S = S \JS X U • • -US n , and vr = {a[ < A'( Xi ) \ 
1 < i < n, x % G dom(A')}). 

By (7), if Xi G dom(A'), then 

Sio'i <V2 Oi <V2,i n <i A'(a:;). 

Therefore S is a solution to it. 

Pick £/ G MGS(vr) and let A" = A' T . Then 

(UA",Gen(UA",Ua' } G PP p r(M). 

By Convention 35, there exists an R such that RU A" = SA" and 
RUa' = Sa' . 

Then by (6) and (7), A <i SA", and Sa' <\/2 <?, as desired. 



□ 



Theorem 64 

i) If I 2 + (rec-int) > A\- M : a, then Pf > A\- M :cr. 

ii) If P 2 + (rec-int) >ihl:(j, then P^>ihM:(7. 

Proof: 

i) By induction on derivations. The cases for all the rules except (rec- 
int) are just as for Theorem 58, so assume that I 2 + (rec-int) >ih 
(pxM) : a holds by rule (rec-int). We must have 

(Vi el) I 2 >A x \J{x: A je/ tj} \-M-.Ti 

and a = r 8o for some %q G I. By induction, 

(Vi G /) P^ > A* U {x : A je/ r ; -}hM: r,-. 

By the principal pair property of P!f , there is a pair (A 1 , a'} and sub- 
stitution S such that 

A^ U {a; : /\j^jTj} <i 5 A', and Sa' <\/2 t« for all i e I. By weakening, 

P^ > A x U {a; : /\ Tj } h M : SV. 
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Then by rule (rec), 

Pf> Ah (jixM) : Sa', 

and by rule (sub), 

P^ > A h (jixM) : r io . 

ii) Identical to the last case. 

□ 

7 Compiling with rank 2 intersection types 

We briefly discuss some applications of rank 2 intersections in compilation. 

Polymorphism allows a function F of type \/t.t — ► t to be applied to 
arguments of any type. Unfortunately, it also requires that the data rep- 
resentation of its arguments be reduced to a lowest common denominator: 
the machine code for F cannot handle both a 32-bit integer in a general 
purpose register and a 64-bit floating point number in a float register. In 
practice, arguments are "boxed," or represented as a pointer to the actual 
data value stored in main memory. Boxing and unboxing coercions slow 
program execution. 

These overheads can be reduced when more is known about the uses of 
the polymorphic function. For example, consider the program 

M = (A/.(/3,/true))F. 

A naive implementation would insert instructions to box the arguments 3 
and true before passing them to F. A more clever implementation would 
recognize that the only arguments of F are integers and booleans, both of 
which can be represented in a single 32-bit register; so F could be compiled 
to expect an unboxed value as its argument. 

This can easily be achieved in P2. To compile M, we first calculate the 
principal typings of the operator and operand: 

(A/.(/ 3, / true)) : Vs, M.(int — ► s) A (bool — ► u) —^ s X u, 
F : Vt.t -► t. 

The type of the operator indicates that F will only be applied to integers 
and booleans, and the compiler can take advantage of this in generating the 
machine code for F. Note that this improves on Bj0rner's minimal typing 
derivations [3], which would require the arguments to be boxed. 
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P2 also supports other data representation strategies. For example, in 
compiling the program (A/.(/ 3,/ 2.4))_F, we will calculate the principal 
typing 

(A/.(/3,/2.4)) : Vs,w.(int -► s) A (float -► w) -► s X w. 

If floating point numbers are 64-bit values, we can't just compile _F to expect 
its argument in a 32-bit register, as before. Boxing is one solution. But 
another solution is possible: specialization [8]. We can generate two versions 
of F, one expecting an unboxed integer in a 32-bit register, and one expecting 
an unboxed float in a 64-bit register. We are essentially overloading the 
variable /, so the application (/ 3) invokes the integer-expecting F, and 
(/ 2.4) invokes the float-expecting F. 

8 Conclusion 

We discussed a variety of rank 2 type systems: A2, the rank 2 fragment 
of System F; I2, I|, and I^, all variants of the rank 2 intersection type 
discipline; and P2, which adds ML-style, top-level quantification of type 
variables to I2. We showed that all of the systems are equivalent in terms 
of typability — a term is typable in one system if and only if it is typable in 
another. An immediate corollary is that typability in all of these systems is 
DEXPTIME-complete. We have also determined that the sequence I2 , I|, 
I2, P2 is in order of increasing "expressiveness." For example, a judgment 
of I^ is a judgment of P2, but not vice versa. 

We proposed a new rule for typing recursive definitions that can type 
many examples of polymorphic recursion. The extension of P2 by this rule 
results in a system with principal typings and decidable type inference. 

Finally, we discussed some applications of intersections in compilation. 
The finite polymorphism of intersections expresses data representation con- 
straints more accurately than polymorphism by quantification. The accu- 
rate expression of these constraints leads to data representations that require 
fewer boxing and unboxing coercions at runtime. 
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